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METHOD FOR MOTION ESTIMATION AND BANDWIDTH 
REDUCTION IN MEMORY AND DEVICE FOR PERFORMING 
5 THE SAME 

BACKGROUND OF THE INVENTION 

1. Field of the Invention 

The present invention generally relates to motion estimation in an 
10 image processing system and, more particularly, to motion estimation 
and bandwidth reduction in a memory device 

2. Description of the Related Art 

Motion estimation is the process of looking for the best motion 

15 vector that minimizes the prediction errors. Prediction error is defined 
as the difference between the current macroblock (MB) and the 
prediction MB. In MPEG (Moving Picture Experts Group) codec design, 
motion estimation demands large DRAM (dynamic random access 
memory) bandwidth. This is because a great deal of data must be 

20 transferred from the DRAM to the processors for achieving motion 
estimation. MPEG is the committee that developed international 
standards (such as MPEG-1, MPEG-2 and MPEG-4) that made 
interactive video on CD-Rom and digital TV possible. 

In general, a frame is formed by a large amount of pixels generally 

25 including 8 bits, 16 bits, or 32 bits. For a frame with a size of 640 x 
480 pixels where each pixel has 8 bits, there are a total 640 x 480 x 8 
bits in the frame. If the image should be displayed with a rate of 60 
frames per second, then there are 640 x 480 x 8 x 60 bits (about 147 
Mbits) of data to be displayed in one second. Since calculation of 

30 motion estimation is related to frames of different time periods and an 
algorithm provided must contain several operation steps, the amount of 
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calculation required in processing the image data is very great. If the 
bandwidth of the DRAM is not sufficient, the calculation of the motion 
estimation will be delayed due to insufficient provision of the frame data, 
thereby causing the display speed of video image to slow down 
5 significantly. There is an optional thus a general need in the act for 
motion estimation method and device advantageously overcoming the 
aforementioned shortcomings in the art. In particular, a need exists in 
the art for an optional method and device, for motion estimation and 
bandwidth reduction in a memory device where the amount of 
10 calculations is greatly reduced and the bandwidth of the memory device 
is significantly reduced in optionally providing motion estimation. 

SUMAMRY OF THE INVENTION 

The present invention provides a motion estimation method for 

15 estimating a motion vector between a reference frame and a current 
frame, where each of the reference and current frames are formed by a 
plurality of pixels. The motion estimation method according to a 
preferred embodiment of the invention comprises the following steps 
dividing plurality of reference macroblocks, each comprising a plurality 

20 of adjacent pixels, is divided within each reference frame where a set of 
the reference macroblocks forms a search range. A current macroblock 
comprising a plurality of continuous pixels is divided from each current 
frame where each of said the reference macroblocks and current 
macroblocks has generally the same size and shape with corresponding 

25 pixel distribution. The similarity between one of the reference 
macroblocks and one of the current macroblocks is determined based on 
the averages of every two adjacent pixels in the current macroblock and 
one of the predetermined reference macroblocks, where two pixels for 
averaging is considered one pixel unit. All the similarities for the 

30 predetermined reference macroblocks in the search range are determined 
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for performing a coarse tune operation. A preferred reference 
macroblock is determined from the predetermined reference macroblocks 
based on the similarities for the predetermined references macroblocks. 
The similarities for a second set of predetermined reference macroblocks 
5 around the preferred reference macroblock based on the pixels of the 
current macroblock and the second set of predetermined reference 
macroblocks for performing a fine tune operation. A motion estimation 
of the current frame and the reference frame is determined from the 
similarities for the second set of predetermined reference macroblocks. 

10 Moreover, the present invention provides a motion estimation device 

for reducing memory output bandwidth comprising a memory serving as 
a frame buffer for storing a plurality of frame data, a controller 
connected to the memory for inputting and processing data of a current 
frame and a reference frame and outputting the processed data. The 

15 motion estimation device according to this particular embodiment of the 
invention further comprises a first motion estimation processor 
connected to the controller for coarse-tuning a motion estimation of the 
current frame to the reference frame inputted from the controller, and a 
second motion estimation processor connected to the controller and the 

20 first motion estimation processor for fine-tuning motion estimation of 
the current frame to the reference frame inputted from the controller and 
the first motion estimation processor. 

BRIEF DESCRIPTION OF THE DRAWINGS 

25 The various features and advantages of the present invention will be 

more readily understood from the following detailed description when 
read in conjunction with the appended drawings (not necessarily drawn 
to scale) in which: 

Fig. 1 is a block diagram illustrating an architecture and associated 

30 data flow for performing an exemplary operation of motion estimation 
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according to a preferred embodiment of the invention; and 

Fig. 2 is an exemplary schematic view illustrating the processing of 
the image data according to the invention. 



5 DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 

Fig. 1 is a block diagram that illustrates an exemplary architecture 
and data flow for performing an exemplary operation of motion 
estimation in accordance with a preferred embodiment of the invention. 
The architecture according to this particular embodiment of the 

10 inventory comprises a codec 202 (encoder and decoder) coupled with a 
central processing unit (CPU 205), with a DRAM controller 206 coupled 
with two motion estimation processors ME1 and ME2. The architecture 
according this embodiment further comprises a memory array 204, a TV 
decoder 203 and an image processor 207 coupled with DRAM controller 

15 206. Referring to Fig. 1, the single-chip video codec 202 (encoder and 
decoder) does not act like a core chipset in a personal computer (PC) 
system. For an encoder, it could take the video input from a variety of 
sources, such as TV decoder 202. The memory array 204 serves as a 
frame buffer for codec 202 in an encoding process. After the video 

20 image is compressed in codec 202, it is sent to an external CPU 205. 
The processing at the CPU 205 with respect to the compressed bit stream 
is application-dependent. The CPU 205 can simply store the 
compressed bit stream in a local hard drive (not shown), or a portable 
storage media such as a compact flash card (not shown). The CPU 205 

25 can also send the compressed bit stream to a decoder for real-time 
decoding or for graphical display through wired or wireless 
communication means. Moreover, the CPU 205 can perform system 
multiplexing or de-multiplexing with other audio/video sources. The 
TV decoder 203 takes the compressed bit stream as the input and restores 

30 it to a format suitable for graphical display. The output data of a 
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decoder can be sent to a TV, liquid crystal display (LCD) panel or an 
external CPU. For encoding and decoding, an external CPU 205 simply 
acts as a data transfer interface, with application-depending processing 
work in addition thereto. 
5 In accordance with the image data processing structure and the 

process of data flow the processing according to a preferred embodiment 
of the invention, the memory array 204 is used as a frame buffer for 
reading and writing data. The memory array 204 is connected with a 
DRAM controller 206 that controls the read/write operation of the 

10 memory array 204. The DRAM controller 206 has two outputs 14 and 
15 connected to two motion estimation processors, a first motion 
estimation processor ME1 and a second motion estimation processor 
ME2. According, the output 14 is connected to the first motion 
estimation processor ME1, and the output 15 is connected to the second 

15 motion estimation processor ME2 The first motion estimation 
processor ME1 performs a pixel search with the search range of ±16, ±32, 
or ±64 pixels. Based on the pixel search results from ME1, ME2 
further conducts a local small area search to fine-tune the best motion 
vector search 

20 In the following, a frame of 640 x480 pixels is utilized as an 

example in illustrating the operation of ME1 and ME2. A macroblock 
of 30 x 30 pixels is used herein as a unit for determining the motion 
vector between a reference frame (generally a previous frame) and a 
current frame. The macroblock (MB) in the reference frame is defined 

25 as a reference macroblock and a macroblock (MB) in the current frame is 
defined as a current macroblock. In motion estimation, it is desired to 
determine the motion relationship of the current frame relative to the 
former frame. An algorithm for finding the relationship between the 
reference frame and the current frame is utilized in accordance with the 

30 invention. 
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The pixels in the current frame by Pc, and the pixels in the reference 
frame are denoted by Pr. If the macroblock has a rectangular contour and 
a transversal length of n and a longitudinal length of m, the area of the 
macroblock cover n><m pixels. Moreover, the center of the current 
5 macroblock is at a pixel with the corresponding coordinate of (k, 1). 
The motion estimation processor ME1 then performs the following 
operation, 

n-1 m-1 

SAD = X Z |Pc(k+i,l+j) - Pr(k+i, l+j)| (1) 

i=0 j=0 

where SAD represents the sum of the absolutions of the differences for 
10 the current macroblock and reference macroblock. That is, the 
corresponding pixels in the current frame and the reference frame are 
subtracted from one another, and the absolution values of the differences 
are then summed. The second reference macroblock with a center 
shifting from the original reference macroblock is adopted to obtain 
15 another SAD between the current macroblock and the second reference 
macroblock. For example, the center of the second reference 
macroblock includes a shift of (a, b) from the original center (k, 1). 
Accordingly, the center of the second reference macroblock is (k + a, 1 + 
b), where the SAD is calculated as follows, 

20 

SAD = § § 1Pc(k+i,l+j) - Pr(k+i+a, I+j+b)| (2) 

i=0 j=o • 

The SADs are calculated with coordinates (a, b), with (a, b) located 
in the range A for coarse tuning with a longitudinal length of r and 
transversal length of s. That is, the following calculations are 
25 performed, 

n-1 m-1 

SAD (a, b) = £ £ |Pc(k+U+j) - Pr(k+i+a, l+j+b)| 

i=0 j=0 

-r/2 ^ a ^ r/2, and -s/2 ^ b ^ s/2 

(3) 
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In general, there are two motion estimation processors, one of which 
is used for coarse tuning, and the other for fine tuning. Namely, the 
shifting of the center of a macroblock in coarse tuning is generally larger 
than that of fine tuning. For example, for the first motion estimation 
5 processor ME1, the shifting of the center of a macroblock is one pixel. 
For the second motion estimation process ME2, the shifting of the center 
of the macroblock is 0.5 pixel. 

For each SAD, there is a corresponding motion vector from the 
center of the reference macroblock to the center of the current 

10 macroblock. In the above exemplary calculations, for the current 
macroblock with a center at (k, 1) and a reference macroblock with a 
center at (k+a, 1+b), the motion vector is (a, b) (namely ((k+a, 1+b) - (k, 
1)) corresponding to a SAD(a, b). In accordance with the invention, the 
motion vector with the smallest SAD among the SAD s within coverage A 

15 is selected and provided as motion estimation connecting the reference 
frame and the current frame, namely, 

Motion estimation = Motion vector with min (SADs) (4) 

20 Thus, the first motion estimation processor ME1 provides a first 

motion estimation based on the SADs calculated by coarse tuning. 
Then, in the second motion estimation processor ME2, the same 
procedure from equation (1) to equation (4) are generally repeated for 
the macroblock corresponding to the first motion estimation, as each 

25 shifting becomes smaller than that utilized in the first motion estimation 
processor MEL For example, as the shift is equal to 0.5 pixel, the 
coverage is reduced to an area smaller than (r, s), namely (r\ s'), where 
r' < r, and s' < s. A second motion estimation can similarly be 
determined based on the SAD calculated by fine tuning. The second 

30 motion estimation is adopted as motion estimation between the current 
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macroblock and reference macroblock. 

In determining the SAD in accordance with the invention in the ME1, 
the average of two adjacent pixels is utilized to acquire the SAD. That 
is, the equation 1 is replaced by the following equation: 

n-l m-1 

5 SAD average = X Z |Sc(k+i,l+j) - Sr(k+i, l+j)| (5) 

i=0 j=0 

where Sc(k+i,l+j) = (Pc(k+i,l-i-j) + Pc')/2, and Pc' is a pixel adjacent to 
the Pc(k+U+j). 

In one embodiment according to the motion estimation methodology 
of the invention, each pixel is represented by 8 bits, i.e., each pixel has 

10 at most 256 gray level, then Pc(k+i,l+j) and Pc' include 8 bits. For a 
frame of 640 x480 pixels, if a motion estimation processor is required to 
read all the pixels in the frame, then about 2.4 Mbits (640 x480 x8 bits) 
of data are read. In accordance with the invention, Pc(k+i,l+j) and Pc' 
are summed in the DRAM controller 206 before being read into the 

15 motion estimation processor ME1, and only the sum data is read into 
ME1. The sum of Pc(k+i,l+j) and Pc' accordingly includes 9 bits with 
one carrying bit in the summation operation, as Sc(k+i,l+j) = (Pc(k+i,l+j) 
+ Pc')/2. In a binary operation, the average of two digital data is equal to 
the least significant bit of the sum of the two digital data, as Sc includes 

20 only 8 bits. For pixels Pc(k+i,l+j) and Pc', the first motion estimation 
processor ME1 only needs to read 8 bits instead of 16 bits of the 
Pc(k+i,l+j) and Pc'. That is, only half of the bits of a frame are read 
into first motion estimation processor MEL However, since in 
equation (5), Sc(k+i,l+j) = (Pc(k+i,l+j) + Pc')/2, Sc includes only 8 bits, 

25 the first motion estimation processor ME1 can only read 8 bits for each 
two pixels instead of 16 bits of the Pc and Pc'. Therefore, the 
bandwidth required to transfer the same amount of data at the same time 
interval is substantially reduced, i.e., to at least one half. 

The second motion estimation processor ME2 executes an operation 
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of fine tuning for the current macroblock and reference macroblock 
where motion estimation is acquired in the first motion estimation 
processor ME1. Generally the same processes pursuant to equations (1) 
to (4) are performed, where all the pixels of a frame are read into ME2. 
5 Since the sum of two adjacent pixels are read in ME1, in the second 
motion estimation processor ME2 it is only necessary to read the 
difference of two adjacent pixels and sum values input from the first 
motion estimation processor ME1, the pixel values of a frame can then 
be restored accordingly. 
10 If the pixels to be read are PI and P2 which are adjacent to one 

another, then 

SUM = PI + P2 and 

DIFF -P1-P2 
After ME2 reads the SUM and DIFF, ME2 can restore PI and P2 by: 
15 PI " = (SUM + DIFF) /2; and 

P2 - (SUM - DIFF)/2. 

In accordance with the above mentioned embodiment of the motion 
estimation methodology of the invention, each pixel is represented by 8 
bits. That is, each pixel includes at most 256 gray levels, with 

20 Pc(k+i,l+j) and Pc' having 8 bits. Pc(k+i,l+j) and Pc' are subtracted in 
the DRAM controller 206 before being read into the second motion 
estimation processor ME2. The different data are then read into the 
second motion estimation processor ME2. The DIFF of Pc(k+i,l+j) and 
Pc' accordingly has 9 bits including one sign bit and 8 bits of difference 

25 value. When the first motion estimation processor ME1 reads the sum 
data, the last bit of the sum data is not read into first motion estimation 
processor ME1. This last bit is transferred to ME2 for restoring the 
original data. Consequently, for each two adjacent pixels, there are ten 
bits of data being transferred to the second motion estimation processor 

30 ME2 from the DRAM controller 206, instead of 16 bits as may be 
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required in the prior art. Thus, the bandwidth necessary for 
transferring data from the DRAM controller 206 to the second motion 
estimation processor ME2 is substantially reduced, as much as 62.5 % as 
opposed to that in the prior art. 
5 In accordance with a preferred embodiment, the motion estimation method 

according to the present invention comprises the steps of: (a) displaying an image in 
a plurality of frames corresponding to a plurality of time periods, wherein each 
frame further comprises a given number of pixels and each pixel is represented by 
two-dimensional abscissa and ordinate coordinates, wherein each frame further 

10 comprises at least one macroblock having a lesser number of pixels than the given 
number of pixels of the frame, wherein a current frame is one of the frames in a 
current time period of the plurality of time periods, wherein the current frame 
further comprises at least one current macroblock having a lesser number of pixels 
than those of the current frame, wherein a reference frame is one of the frames in a 

15 time period of the plurality of time periods prior to the current time period for the 
current frame, wherein the reference frame further comprises at least one reference 
macroblock having a lesser number of pixels than those of the reference frame, (b) 
averaging two adjacent pixels of the current macroblock, (c) repeating step (b) for 
all pixels of the current macroblock, (d) averaging two adjacent pixels of the 

20 reference macroblock, (e) repeating step (d) for all pixels of the reference 
macroblock, (f) subtracting the averaged pixels of the current macroblock from the 
corresponding averaged pixels of the reference macroblock resulting in a plurality 
of differences, (g) taking a plurality of absolute values for the differences resulting 
in a plurality of absolutions, (h) summing the absolutions resulting in a SAD (sum 

25 of the absolutions of the differences), (i) shifting abscissa and ordinate coordinates 
of the reference macroblock by corresponding abscissa and ordinate shift values 
resulting in a shifted reference macroblock, (j) averaging two adjacent pixels of the 
shifted reference macroblock, (k) repeating step (j) for all pixels of the shifted 
reference macroblock, (1) subtracting the averaged pixels of the current macroblock 

30 from the corresponding averaged pixels of the shifted reference macroblock 
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resulting in an additional plurality of differences, (m) taking an additional plurality 
of absolute values for the additional differences resulting in an additional plurality 
of absolutions, (n) summing the additional absolutions resulting in an additional 
SAD (sum of the additional absolutions of the additional differences), (o) repeating 
5 steps (i), (k), (1), (m) and (n) resulting in a plurality of additional SADs, and (p) 
taking a minimum SAD out of the SAD and the additional SADs. Further 
according to the method of the invention, the two adjacent pixels of the reference 
macroblock are restored by summing first and second of the two adjacent pixels 
resulting in an adjacent sum, subtracting the first pixel from the second pixel 
10 resulting in an adjacent difference, wherein the first pixel is equal to: 

(the adjacent sum + the adjacent difference)/2 
wherein the second pixel is equal to: , 

(the adjacent sum - the adjacent difference)/2. 

Fig. 2 is an exemplary Schematic view that illustrates the processing 

15 of image data according to the invention. Referring to Fig. 2, there are 

16 pixels, i.e., Y0, Yl, Y2, Y3, Y4, Y5, Y6, Y7, Y8, Y9, Y10, Yll, Y12, 

Y13, Y14, Y15, to be read from the memory. The method for summing 

and taking difference values (i.e., obtaining SUM and DIFF data) can be 

summarized as the following: 

20 Y0 Yl Y2 Y3 Y4 Y5 Y6 Y7 Y8 Y9 Y10 Yll Y12 Y13 Y14 Y15 
™„> SO SI S2 S3 S4 S5 S6 S7 (SUM) 
— > DO Dl D2 D3 D4 D5 D6 D7 (DIFF) 

Each of Yi, Si and Di represents a pixel. For Yi, each pixel includes 
a width of 8 bits. For S and D, the pixel width is 9 bits. 
25 S and D are computed as follows: 

SO - Y0 + Yl, SI - Y2 + Y3, S7 = Y14 + Y15 
DO = Yl - Y0, Dl = Y3 - Y2, D7 - Y15 - Y14 

In the first motion estimation processor ME1, only the SUM part (8 
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bits/pixel, 8 pixel/row) is read from the memory array 204, the 50% of 
the DRAM bandwidth is advantageously conserved in ME1. 

In where at least ME2, only the DIFF part (10 bits/pixel, 8 
pixels/row) is read from memory array 204, where at least 40% of the 
5 DRAM bandwidth in advantageously conserved in ME2. 

Other then estimating based on a frame, the present invention can be 
used to field estimation because in some imaging systems, the images 
are displayed in the order of first displaying even lines and then 
displaying the odd lines. Accordingly, the image similarities between 
10 two even lines are greater than those between two adjacent lines. 
Therefore, an image frame is divided into a frame having two 
sub-images, one being called the even image frame formed by the even 
lines in the image frame and the other being called the odd image frame 
formed by the odd lines in the image frame. A top field motion vector for 
15 the even image and the bottom field motion vector for the odd image are 
calculated by the aforementioned structure and process in accordance 
with the present invention. A field motion vector is then acquired by 
summing the top and bottom field motion vectors. 

In addition, for an image, the present invention advantageously 
20 allows that the frame motion estimation and field motion estimation to 
be determined generally simultaneously, where the one having a 
preferred effect is used as a selected motion estimation of the current 
macroblock. 

Tables 1 and 2 illustrate the simulation results in comparing the 
25 effects of the present invention and the method utilized in the prior art, 
where the overall bandwidth saving for the present invention over the 
prior art is at least 42% to 46%. 
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Table 2 DRAM bandwidth consumption in ME2 implementation. 



In table 1 as shown above, the first column lists the size of the 
5 search range of reference macroblocks, namely ±16 x ±16 pixels, ±32 x 
±16 pixels, and ±64 x±32 pixels. The second column of Table 1 lists 
which kind of motion estimation is executed, where the P frame refers to 
as the frame-based motion estimation being performed on P-VOP, P field 
refers to as the field-based motion estimation being performed on P-VOP, 

10 B frame refers to as the frame-based motion estimation being performed 
on B-VOP, and B field refers to as the field-based motion estimation 
being performed on B-VOP. The third and fourth columns illustrate the 
number of reading bits by the first motion estimation processor ME1 and 
second motion estimation processor ME2 respectively, which is 

15 represented by time periods of a frame. For search ranges of ±16 * ±16 
pixels, and ±32 x±l6 pixels, the motion estimation processors include a 
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buffer with a size of 18 x 8 *16 x 8 bits. For the search range of ±64 x± 
32 pixels, the buffer of the motion estimation processors is 50 x 8 xl6 x 
8 bits. Table 1 shows the results acquired by a conventional motion 
estimation method in the prior art, whereas Table 2 shows the results 
acquired by the method in accordance with a preferred embodiment of 
the invention. The last column of the Table 2 illustrates an exemplary 
bandwidth reduction ratio of the present invention over the conventional 
motion estimation method, namely at least 43% of DRAM bandwidth 
savings. 

In the above embodiment according to the invention, SAD (sum of 
absolute differences) is used as an example for clarifying the present 
invention to those skilled in the art, while other equivalent methods 
which can be used to identify the similarity of two image frames are also 
suitable for use in the present invention. 

In a further embodiment of the present invention, the sum of square 
differences (SSD) of the current macroblock and reference macroblock is 
used in determining the motion vector between the current macroblock 
and reference macroblock. That is, 

n-l m-1 

SSD (a, b) = 2 E (Pc(k+i,l+j) - Pr(k+i+a, 1+j+b)) 2 

i=0 j=0 

-r/2 ^ a ^ r/2, and -s/2 ^ b ^ s/2 
where all the parameters identical to equation (3) have identical 
meanings as those defined in the above equation. 

Moreover, in the present invention, the first motion estimation 
processor ME1 executes the following operation: 

n-l m-1 

SSD (a, b) = 2 S (Sc(k+i,l+j) - Sr(k+i+a, 1+j+b)) 2 

i=0 J=0 

-r/2 ^ a ^ r/2, and -s/2 ^ b < s/2 
where Sc and Sr are identical to those defined above. 

In another embodiment of the present invention, the sum of 
correlations (SC) of the current macroblock and reference macroblock 



15 



are used in determining the motion vector between the current 
macroblock and reference macroblock. That is, 

n-l m-1 

SC(a, b) = £ Z Pc(k+i,l+j) xPr(k+i+a, 1+j+b) 

i=0 j=0 

-r/2 ^ a ^ r/2, and -s/2 ^ b ^ s/2 
5 where all the parameters identical to equation (3) have identical 
meanings as those defined in the above equation. It should be noted that 
as the correlation method is utilized to determine the similarity of two 
image frames, the motion estimation is determined from the motion 
vector with the largest SC. 
10 Similarly, in the present invention, for the first motion estimation 

processor ME1, the ME1 executes the following operation, 

n m 

SC(a, b) = £ Z Sc(k+i f l+j) xSr(k+i+a, 1+j+b) 

i=i j=i 

-r/2 ^ a < r/2, and -s/2 ^ b ^ s/2 
where Sc and Sr are identical to those defined above. 

15 Those skilled in the art should appreciate that although in the above 

description, an image frame is used as an object for motion estimation, 
for MPEG-4 it is allowable that the background and moving object are 
separated in storage. As required, the background image and object are 
assembled as a complete image. It is obvious from the above 

20 description that the present invention can be performed just for the 
moving object so as to acquire the motion estimation of a moving object. 
The same or substantially similar or equivalent algorithm can be applied 
to the moving object, where the contour of the macroblock based on the 
moving object is modified to fit the contour of the search range. For 

25 example, the original equation (3) can be replaced by the following 
equation, 

SAD (a, b) - £ |Pc(k+i,l+j) - Pr(k+i+a, l+j+b)| for(a, b) e H 
wherein C represents the contour of the macroblock based on the moving 
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object and H represents the contour of the search range. Similarly, the 
ME1 performs a sum operation based on these new contours. 

It should be appreciated that in the present invention, the memory 
may be any suitable memory such as dynamic random access memory 
5 (DRAM) or static dynamic random access memory (SDRAM). 

While the above is a detailed description of the specific 
embodiments of the invention, various modifications, alternative 
constructions and equivalents may be used to achieve substantially the 
same effects as the present invention. In particular, the process steps 

10 according to the invention as described herein and above can be 
similarly implemented by other similarly situated process steps 
achieving substantially the same result. Therefore, the above 
description and illustrations should not be taken as limiting the scope of 
the present invention which is defined by the appended claims and their 

15 equivalents. 
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